ON THE SPEED OF CONVERGENCE OF NEWTON'S METHOD 
FOR COMPLEX POLYNOMIALS 



TODOR BILAREV, MAGNUS ASPENBERG, AND DIERK SCHLEICHER 



Abstract. We investigate Newton's method for complex polynomials of arbitrary 
degree d, normalized so that all their roots are in the unit disk. For each degree d, 
we give an explicit set Sd of 3.33d log^ d(l+o(l)) points with the following universal 
property: for every normalized polynomial of degree d there are d starting points 
in Sd whose Newton iterations find all the roots. If the roots are uniformly and in- 
dependently distributed, we show that the number of iterations for these d starting 

^ ^ points to reach all roots with precision e is 0(d^ log* d + d log | log £| ) (with prob- 

ability pd tending to 1 as d — ^ oo). This is an improvement of an earlier result in 
D |Sch] ■ where the number of iterations is shown to be 0{d* log^ d+d^ log^ d| logs]) in 

the worst case (allowing multiple roots) and 0(d^ log'^ d(logd+log(5) + dlog | loge|) 
for well-separated (so-called 5-separated) roots. 

T-H Our result is almost optimal for this kind of starting points in the sense that 

the number of iterations can never be smaller than O(d^) for fixed e. 



1. Introduction 



^ Newton's root finding method is an old and classical method for finding roots of a 

differentiable function; it goes back to Newton in the 18th century, perhaps earlier. 
It was one of the main reasons why A. Douady, J. Hubbard and others in the late 
J> 1970s studied iterations of complex analytic functions. The main question was to 

J;^ know where to start the Newton iterative method in order to converge to the roots 

of the polynomial. Newton's method is known as rapidly converging near the roots 
(NJ (usually with quadratic convergence), but had a reputation that its global dynamics 

was difficult to understand, so that in practice other methods for root finding were 
used. See [RJ for an overview on recent results about Newton's method. 

Meanwhile, some small sets of good starting points are known: there are explicit 
. . deterministic sets with 0{dlog^ d) points that are guaranteed to find all roots of 

appropriately normalized polynomials of degree d [HSS], and probabilistic sets with 
^ as few as 0((i(log log d)^) points [BLSJ. 

^ We are interested in the question how many iterations are required until all roots 

are found with prescribed precision e. In |Sch| . it is shown that among a set of 
starting points as specified above, there are d points that converge to the d roots and 
require at most 0{d'^ log^ d + d^ log^ d\ loge|) to get e-close to the d roots in the worst 
case; for randomly placed roots (or for roots at mutual distance at least S for some 
5 > 0), the required number of iterations is no more than 0{d^ log^ d + dlog | loge|) 
(with the constant depending on 5). This is about one power of d away from the 
best possible bounds. 

In this paper, we show that Newton's method is about as fast as theoretically 
possible. We consider the space of polynomials of degree d, normalized so as to have 
all roots in the complex unit disk D. Our main result is the following. 
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Theorem 1 (Quadratic Convergence in Expected Case). For every degree d, there 
is an explicit universal set of points in C, with \Sd\ = 3.33(ilog^ d(l + o(l)), 
with the following property: suppose that ai, . . . , a„ are uniformly and independently 
distributed in the unit disk and put p{z) = 11^=1 (-^ ~ ^j)- Then there are d starting 
points in Sd such that with probability Pd ^ 1 as d —t- oo, the number of iterations 
needed to approximate all d roots with precision e starting at these d points is 

0{d^ log^d + dlog|loge|). 

Remark 1. As stated, the theorem deals with d distinguishable (i.e., ordered) roots 
and their associated probability distribution. We prove that the same result holds if 
we identify our polynomials in terms of their sets of indistinguishable roots, as two 
polynomials p{z) = 11^=1 (-^ ~ ^j) ^^"^ — 11^=1 (-^ ~ l^j) ^^'^ same if their 
unordered sets of roots {ai, . . . , ad} and . . . , fid} are equal. 

Remark 2. This bound on the number of iterations is optimal in the sense that there 
is no bound on the number of iterations in the same generality that for fixed e has 
asymptotics in o{d?), so we are away from the best possible bound only by a factor 
of about 0(log^ d). 

2. Good starting points for Newton's method 

Studying the geometry of the immediate basins outside the unit disk D, in |HSSj 
we proved the existence of a universal starting set with l.lldlog^ d points depending 
only on d such that for every polynomial of degree d with all roots in the unit disk, 
and for every root, there is a point in the set which is in the immediate basin of this 
root. Enlarging the set by a factor of 3 approximately, in |Schj we obtained a set of 
starting points Sd which ensured that for each polynomial p and each root a there is 
a point z in Sd intersecting the immediate basin [7 of a in the "middle third" of the 
"thickest" channel, where a channel is an unbounded connected component of C/\D. 
Being in this middle third implies an upper bound on the displacement du{z, Np{z)) 
in terms of the Poincare metric of the immediate basin. It also turns out that the 
orbit of z under iteration of Newton map does not leave Djj(O), the disk of radius R 
centered at the origin for some bounded value of R. We will refer to such points as 
having R-central orbits. 

More precisely, let Sd be defined as follows. 

Definition 2 (Efficient Grid of Starting Points). For each degree d, construct a 
circular grid Sd as follows. For A; = 1, . . . , s = [0.4 log d] , set 

rfc = (1 + \/2) ( — 1 

and for each circle around of radius r^, choose [8.33(ilogd] equidistant points 
(independently for all the circles). 

The set Sd thus constructed has 3.33(1 + o{\))d\o^ d points. The following the- 
orem is proven in jSchl Theorem 8]. 

Theorem 3. For each degree d, the set Sd has the following universal property. If p 
is any complex polynomial, normalized so that all its roots are in D, then there are d 
points z^^\ . . . , z*-'^-* in Sd whose Newton iterations converge to the d roots of p. If a 
is a root of p and U is the immediate basin of a, then there is an index i such that 
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z^*) G U with du{z^''\Np{z^^)) < 21ogd. In addition, z^^\ . . . , z^'^'' have R-central 
orbits for 

^ X r5T(logd+i)l 



R<5 



d-l 



For d = 100, we have R < 14; for d = 1000, we have R < 7.5; and asymptotically 
the upper bound on R tends to 7. 

The result provides an upper bound for R that is uniform in d. This set of starting 
points will be the basis for the discussion which follows. 

3. Uniformly distributed roots 

In this manuscript we investigate the Newton map for complex polynomials with 
randomly distributed roots. In this section, we fix notation and give the strategy of 
the proof of our main result, Theorem [T| 

Let a be a simple root of the polynomial p of degree d and U be the immediate 
basin of attraction of a. By the discussion in the previous section, there exists 
zi G Sd with i?-central orbit in U, i.e. under iteration of the Newton map Np the orbit 
converges to a and stays within Dji{0). Let Zn+i := Np{zn) for n > 1. For any two 



consecutive points z„ and z^+i along the orbit of zi, in [Schl Section 4] we constructed 
"thick" curves that, roughly speaking, "use up" area at least \zn — z„+i|^/(2t) with 
T := dij{zi,Z2) < 2\ogd. In the region of quadratic convergence (near the root a), 
only log2 I log2 e — 5| iterations are sufficient to get e-close to the root. Outside this 
region, two such curves with base points Zn and Zn' are disjoint if n' — n > 2t + 6 
|Scht Lemma 11]. The bound 0{d^ log^ d+d log | loge|) follows from lower bounds on 
the displacements \zn — Zn+i \ along the orbit. The main improvement in this paper is 
on the lower bounds on the displacements when the roots are randomly distributed. 

As in |Schj . we partition -D_r(0) (the disk of radius R centered at the origin) into 
domains 

5fc := |z G Dr{0) : min \z - Ojl G (2-^''+^\ 2'^ | , A: G Z . 

It turns out that if the roots are randomly distributed in the unit disk, then with 
high probability the following holds: there exists a universal constant C such that 
for every n we have the following estimates 

I, , I > / if e Sk with k < log2 d- 

\Zn - Zn+l\ ^ \ c 4.U 

i Wk otherwise. 

If Zn G Sk with k < log2 d, then we say that we are "in the far case" , as z„ is far from 
all the roots. Since each such iteration uses an area of at least |z„ — (2r), and 

one in 2r + 6 such areas are disjoint, the total number of orbit points in the far case 
is bounded by 0{d'^ log^ d). 

[Sch', Lemma 16] says that if the orbit gets very close to some root in comparison to 
the other roots, then it has entered the region of quadratic convergence of that root 
where only log2 | log2 e — 5| are sufficient to approximate it within an e-neighborhood. 
We call this "the near case" . 

For randomly distributed roots, the mutual distance between roots is large enough 
so away from the region of quadratic convergence, we only need to consider k < 
3 + (2 + r/) log2 d for a certain r/ > 0. We define the "intermediate case" as those 
Zn G 5"^ with log2 d < k < 3 + {2 + ri) log2 d. Each domain Sk has area 0{dA~''), and 
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each iteration with Zn G uses area about {C /2^k)'^ /2t « C'^/A^k'^r, so the number 
of orbit points in the intermediate case is at most 0{dk'^T) for each k, times the usual 
factor 2r + 6 to make the areas disjoint. But log2 d<k<3 + {2 + r]) log2 d and 
r = O(logd), so the total number of iterations in the intermediate case is 0{dlog^ d). 
In the subsequent sections we will make these arguments precise. 

3.1. On the distribution of the roots. In order to get a lower bound on the 
expected displacement, we will first investigate the distribution of the roots. We 
will be interested in two different kind of probability spaces. The first space Vd = 
{(xi, . . . , Xd) : € O} consists of all polynomials with d distinguishable roots in the 
unit disk, normalized so as to have leading coefficients 1, and the probability measure 
is induced by the Lebesgue measure on D'^. The second space Vd/Sa consists of all 
polynomials with indistinguishable roots in the unit disk, i.e. the quotient probability 
space of the standard action of the symmetric group Sd on Vd defined by permuting 
the roots. 

The following lemma is the probabilistic ingredient of the main theorem. It cer- 
tainly isn't new, but easier verified than looked up in the library. 

Lemma 4 (Base-d numbers). Let Md be the set of all d-digit numbers in base d. 

(a) The probability that that a randomly chosen number a G Md does not have a 
digit repeating more than 0{logd) times is at least 1 — 1/d. 

(b) Let ~ be an equivalence relation on Md defined as follows: a ^ b 3a ^ Sd 
with a = ah, i.e. two elements are equivalent if they have the same sets of digits 
counted with multiplicities. Then the probability that a randomly chosen element 
[a] G Md/ ~ does not have a digit repeating more than O(logd) is at least 1 — 1/d. 

Proof, (a) For fixed i, the number of d-digit numbers which contain at least a digits 
i is at most Thus the number of d-digit numbers which contain a symbol 

repeating at least a times is at most 

\aj a! 

So the probability that a randomly selected number in Md contains at least a identi- 
cal digits is at most ^ since \Md\ = d'^. Therefore, with probability at least 1 — ^, a 
randomly selected number in Md does not have a digit repeating more than a times. 

Note that if a! > d^ we have 1 — ^ > 1 ~ 3- Therefore, by taking a such that 
{a — 1)! < d^ < a\ (which implies that a G O(logd)), we prove the first part of the 
claim. 

(b) Note that the elements of Md/ ~ can be bijectively mapped to the set Mult^ = 
{{xo, . . . , Xd~i) '■ Xi G Z>o, xo + . . . + Xd-i = d} as follows: for [a] G Md/ ~ let Xi be 
the multiplicity of digit i in every a G [a]. It is well known and easy to see that 

(1) \{{xq, . . .,Xr-l) ■ Xi G Z>o,Xo +Xi + ... +Xr-1 = 7T,}| = 

Thus we have |Multd| = {^d-i) • other hand, the number of elements in Mult^ 

with first component at least a is equal to the cardinality of 

{{x2, ...,Xd):xie Z>o, X2 + . . . + Xd < d - a} 

which has the same cardinality as 

{{yi,X2, ...,Xd): yi,Xi G Z>o,yi + X2 + . . ■ + Xd = d- a} . 



n-\- r — 
r-1 
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Again by (ij) this quantity equals to {^"^^^^i^)- Therefore, the number of elements of 
Multrf with a component at least a, i.e. the number of elements of M^/ ~ with a 
digit repeating at least a times, is at most 

'2d- a - V 
d-1 



d 



Hence the probability that a number of M^/ ~ has a digit repeating at least a times 
is at most 

dC^d-i^) d{2d -a- l)\{d - l)\d\ 



(2d-l)!(d-l)!(d 



a 



^ {d-a + l){d-a + 2)---d d 



{2d-a){2d-a + l)---{2d-l) - \2 J 2d-l' 
Hence for a = [21og2 d+ 1~\ £ 0{logd) the second part of the claim follows. □ 

Remark 3. The statement remains true if we replace the probability 1 — 1/d by 
1 — 1/d'^ for any constant c > 1. 

If the roots are randomly distributed in the unit disk one should expect that the 
number of roots in a region is proportional to its area. The previous claim easily 
implies the following statement. 

Lemma 5. Let a polynomial with roots xi,...,Xij_ be randomly chosen in Vd or 
'Pd/Sd- Then there exists a constant Cd G 0(\ogd) such that with probability at least 
l~l/d the following holds true: every disk in C with area A = vrr^ contains at most 
k{A) points among xi, . . . ,Xd with 



kiA) 



CddA ifA>l/d; 
Cd otherwise. 



Proof. Without loss of generality we can assume that d= {2k + if for an integer 
k (the general case follows by enlarging C(2fc+i)2 by a bounded factor). Then the 
unit disk can be subdivided into d pieces as follows (compare Fig. [T]): the first 
piece is a disk with center and radius ro = 1/Vd; next, consider the annuli Ag 
bounded between circles around of radii (2s — l)ro and (2s + l)ro for s = 1, ... A:, 
and subdivide each annulus Ag into exactly 8s pieces of equal area by drawing 8s 
radial segments. Thus we construct exactly d pieces with equal area and diameters 
comparable with tq. By Lemma |4] it follows that each of the pieces contains at most 
0{logd) of the points xi, . . . ,Xd with probability at least 1 — l/d (in both cases of 
distinguishable and indistinguishable roots): in the case of distinguishable roots, the 
i-th digit of a d-digit number specifies the number of the piece containing the i-th 
root; in the other case, the same symmetries apply on both sides of the equality. 

Hence, the claim is true for that particular partition of the unit disk. This implies 
the general claim as follows. It is easy to see that each square of side length at most 
ro in the complex plane can intersect at most a constant number C of these pieces, 
where C does not depend on d. Consider a square S for which the unit circle is 
inscribed, for example the one with sides parallel to the real and imaginary axes. 
Subdivide it into d equal squares of side length ro (using the fact that d is a square 
and ro = 1/Vd). Then each of these smaller squares will intersect at most C pieces 
from the partition of the unit disk (some squares will not intersect any). Therefore 
each of the small squares contains at most C" log d points for some constant C" 
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which does not depend on d. Since each square of side length ro (possibly rotated) 
intersects at most 9 of these squares dividing S*, we conclude that each square of side 
length ro contains, with probability at least 1 — l/d, at most Clogd of the points 
xi, . . . , a;(i for C = 9C". If we group every 4 neighboring small squares (of side length 
ro) and repeat the argument, we get that each square of side length 2ro contains at 
most 4dogd points, and so on for squares of side length 4ro,8ro,.... Thus an 
arbitrary square of side length x G [2*^ro, 2'^+^ro] contains at most 2^'^+^Clogd < 
4C(a;^/rQ) logc? k, ACx^dlogd points since it is contained some square of side length 
2*^+^ro, and thus enlarging the constant by a factor of 4 the lemma will hold true for 
squares. Since each disk of radius r is contained in a square of side length 2r, the 
bound on the number of points in an arbitrary disk follows. □ 

Now we prove the following claim about the mutual distance for randomly dis- 
tributed points in the unit disk. 

Lemma 6. Let the polynomial p{z) be randomly chosen in Vd orVd/Sa- Then the 
mutual distance between any pair of its roots is at least l/d^~^^, for any fixed r) > 0, 
with probability at least 1 — l/d^"^ . 

Proof. First, note that the claim for a randomly chosen polynomial in Vd/ Sd follows 
from the claim for a randomly chosen polynomial in Vd- Choosing randomly a 
polynomial in Vd is equivalent to choosing randomly and independently its roots. 
For a positive number r, the probability pd^r that d uniformly and independently 
distributed points in the unit disk have mutual distance at least r is at least 

Pd,r > (1 - r-')(l - 2r2) ... (1 - (d - ly) 

(the unit disk has area tt, and after k roots are selected, the k + 1-st root must avoid 
an area of at most kirr"^; this has probability (tt — iTkr'^)/'K = 1 — kr"^). 
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Since log(l + x) > x/{l + x) for x > —1, we get 

log... > gfogd - kr^) > E ^ -^Sp ^ -^'l^ ^ ' 

fe=l /c=l 

where the last inequahty holds if dr^ < 1/2. Hence 

Pd,r > exp(-dV2) > 1 - d^r^ . 

If r = l/d^'^'^ (which satisfies dr^ < 1/2), then p^^^ > 1 — l/d"^"^ and thus the claim 
follows. □ 

Remark 4. The precise value of ij in this lemma is not very important to us, as 
eventually it will only affect constants in the bounds we obtain. 

An immediate corollary of the lemma is an upper bound on the distance of Zn to 
the closest root which guarantees quadratic convergence. 

Corollary 7. // the d roots are randomly chosen and Zn G with 2^'^ < l/8d^^'^ 
and T] > 0, then with probability at least 1 — l/d^^ the orbit of z^ converges to the 
closest root a, and log2 | log2e — 5| iterations of Zn are sufficient to get e-close to a. 

Proof. Indeed, if z„ £ Sk and a is the closest root to Zn, then |z„ — a| < 1/8^^+'' 
and for every root aj ^ a we have 

\zn-aj\ > \a-aj\-\a-Zn\ > l/(i^+'' - 1/8^^+'' > {8d + l)/8d'^^'^ > (4d + 3)|z„-a| 

(under the conditions of Lemma [g]). Therefore by [Schl Lemma 16], we need no more 
than log2 I log2 e — 5| iterations to get e-close to a. □ 

We combine the previous two lemmas in the following claim. 

Lemma 8. Let a polynomial with roots xi,...,X(i be randomly chosen in Vd or 
'Pd/ Sd- Then with probability Pd^ ^ — 0{d~'^^) (for fixed r] G (0, 1/2) ), the following 
two statements simultaneously hold true: 

Area Condition (AC): There exists Cd G Oilogd) such that every disk in C 
with area A = nr'^ contains at most k{A) roots among xi, . . . ,Xd with 



kiA) 



CddA ifA>l/d; 
Cd otherwise. 

Distance Condition (DC): The mutual distance between any pair of roots is 
at least l/d^+'i. 



Proof. We are interested in P(AC = true and DC = true), which equals 

1 - P{AC = false or DC = false) > 1 - P{AC = false) - P{DC = false). 

By Lemma [5] we have P{AC = false) < 1/d, and by Lemma [6] P(-DC = false) < 
1/d^'^. Hence the claim follows. □ 

3.2. Proof of the main theorem. In this section we will use the two conditions 
AC and DC to prove Theorem [T] While DC guarantees that proximity to a root 
implies fast convergence (Corollary [7]), AC gives a lower bound on the displacements 
along an orbit far away from the roots. More precisely, the following statement holds 
true. 
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Lemma 9. Suppose that the Area Condition in Lemm.a\^ holds true. If Zn G Sk H 
©2(0), then 

1 

" - (1 + 2C,)2^+i + IGnCdd ' 

where Cd £ O(logd). If Zn ID)2(0), then \zn — Zn+i\ > l/d. 

Proof. The fact that Zn G Sk means that the closest root, say a, is at distance 
for some c G (0.5, 1], and all the other roots satisfy |z„ — a^l > 

First suppose that Zn G Sk H 02(0). This implies that K > —2. Let := {z G 



: 2 



-fe-i 



< |z — < 2~ } for A; = —2, . . . ,K. Then all the roots are contained in 
|J^^_2Tfc. The Area Condition implies that there exists a constant G 0(logd) 
such that the number of roots in Tk is bounded by TrCdd^T^ for T:4r^ > l/d, and by 
Cd otherwise. Thus we have 

^ 1 2^ 



E 



1 



< 



1 



2^ 

< — + 

c 



a 

L0.51og2 TTdJ 



a. 



K 

E E 

k = — 2 oij^a 



1 



a. 



E 



E 



K 



+ 



E 



E 

fe=l+[0.51og2 vrdj "jT^a 



2^ Lo.5k^..^J2^^^^4_, 



< — + 
c 



E 

jt=-2 



2-fc 



K 

+ E ^"^2 

fc=l+[0.51og2 TTdJ 



fe+1 



Therefore 



< 2^+1 + IQ^Cdd + Cd2^+2 ^ 
1 



1 



> 



For the case Zn 152(0) we have 

I 1 ~ 



E 



< 



□ 



and so |z„ — Zn+i| > l/c?- 

Corollary 10. Suppose that the Area Condition in Lenima^holds true. Then there 
is a universal constant C such that the following statements hold for any Zn G S^. 



(1) If2^> l/d, then \zn - Zn+i\ 



> 



(2) // 1/8^2+^ < 2-^ < l/d, then |z„ 
Proof. For Zn G ©2(0), Lemma [o] gives 

I Zn Zn-\-l I ^ 



C 

d log d ' 



Zn+l\ > 



C 



If 2"^^ > l/d, i.e., 2*^"^^ < 2d, the denominator is at most 0{Cdd), so the displacement 
is at least C' /{dlogd) for some universal constant C (since Cd G 0{logd)). On 
the other hand, if l/Sd^^'' < 2~'^ < l/d and thus d < 2^ , the denominator is at 
most 0{Cd2^). In this case, Cd G O(logd) = 0(A;), so the displacement is at least 
C" /{k2^). Therefore the claim follows if we take C = min{C", C"}. 
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Finally, Zn 02(0) implies k < —1. Again Lemma [9] gives |z„ — Zn+i| > ^/d, and 
thus we finish the proof by possibly decreasing the constant C. □ 

The final step towards proving our main result is in the following theorem. 

Theorem 11. Let the polynomial p{z) he randomly chosen in Vd or Vd/Sd and let 
(zn) be an R-central orbit converging to a root a with du{zo,zi) < r for r < 21ogd. 
Then with probability > 1 — 0{d~'^'^) (for fixed rj € (0, 1/2) ), the required number 
of iterations for zq to get e-close to a is 

O {d^ log^ dR^ + log I loge - 5|) . 

Before proceeding to the proof of this statement, we will outline the main idea. 
As in [Sch], we construct "thick" curves connecting orbit points z„ and Zn+i that 
use up certain area contained in a bounded domain. Far from the root, two curves 
corresponding to z„ and Zm are disjoint provided that |n — m| > 2r + 6. A lower 
bound on the area of the "thick" curves gives an upper bound on the number of 
iterations. Also, near the root the orbit enters the domain of quadratic convergence 
where only a few iterations are sufficient to approximate the root. 

More precisely, let 99: f7 — t- D be the Riemann map with (p{a) = considered in 
|Schl Section 5]. If < 1/2 ("region of fast convergence"), then according to 

|Schl Lemma 11] we need only log2 | log2 e — 5| iterations to get e-close to the root 
a. For orbit points with (/9-images having absolute values greater than e^^^ — 1, we 
can prove the following. 

Lemma 12. For every n with \(p{zn)\ > e^/^ — 1, there are open connected subsets 
Vn C -D2_R+2(0) with Zn, Zn+i G dVn and \ Vn\ > \zn — z„+ip/2r, having the following 
property: whenever n and m are such that mm{\ip{zn)\,\(p{zm)\} > e^/^ — 1 and 
\n — m\ > \2t + 6] , we have Vn n Vm = 0- 

Proof. Let 7: [0, s] — )■ C/ be the hyperbolic geodesic within U connecting z„ to Zn+i- 
For each z = 7(t), let rj{t) be the Euclidean distance from ^{t) to dU , and let Xt 
be the straight line segment (without endpoints) perpendicular to 7(t) of Euclidean 
length rj{t), centered at 7(t). Let Vn := U^g^Q^-^Xj. Then all Vn are open and 

connected and Zn,Zn+i G dVn, and the area of Vn is at least \zn — Zn+iP/2r: this 
follows as in |Scht Lemma 9] (in this reference, the areas restricted to certain domains 
Sk are calculated; omitting this restriction, we obtain the result we need, and the 
computations only get simpler). Moreover, the orbit {zn) is i?-central and the unit 
disk contains other roots than a, and hence the length of ^{t) for t G [0, s] is bounded 
by ii+ 1. This implies that all pieces Vn are contained in -D2/?+2(0) by construction. 

The fact that Vn H Vm are disjoint when |n — m| > 2r + 6 is proved in |Schl 
Lemma 12] (again for restricted domains, but this is immaterial for the proof). □ 



Proof of Theorem 11. We only need to consider iteration points whose images under 
if have absolute values at least e^^^ — 1. Also, by Lemma [sj the conditions AC and 
DC hold true with probability > 1 — 0{d^'^^). 



Choose M so that 2 — 1 > 2R + 2. We distinguish the following three cases. 
The Far Case: we have Zn G Sk with 2"^^ > 1/c?. By Corollary 
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_ , . . '1) we have 

\Zn ^n+l 

I ^ Lemma 02^ says that any Newton iteration Zn ^ Zn+i 



with Zn G Sk needs area at least 

\Zn ~ Zn+1\ 



> 



2r - 2r(i2log^d 
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Moreover, the pieces of area for the iterations i— )• z^+i and Zn' ^ -Zn'+i 
are disjoint provided that n — n' > 2t + 6, and ah these pieces of area are 
contained in the disk £)2_R+2(0) with R universaUy bounded. 

The total number of such iterations I?2_R+2(0) can accommodate is thus at 
most 

C (f{\ogdfT\2T + Q\R^ 

for a universal constant C . 
The Intermediate Case: we have Zn G Sk with 1/8^2+'' < 2'^ < l/d. Then 



log2 d<k<2> + {2 + r]) log2 d. By Corollary 10 (2) we have |z„ — Zn+i\ > 



C/k2^. Thus by |Scht Proposition 13], the set contains at most 



7Td2-^''k-^{2k + Cf i2T+^) \2t + 61 



TrdC^'^{2k + Cf{2T + ^] \2t + 6] 



< T:dC-'\<o + (4 + 2r?) log2 d + Cf[2T + ^rr^ ) \2t + 6] 



2k) ' ' ^2 

2k^ 

2 log2 d 

< C'dlog^ d(2T + 1) [2r + 6] 

orbit points for some universal constant C" . There are 3+(l+r/) log d possible 
values of k in the Intermediate Case, so IJ;, Sk (for all k in the Intermediate 
Case) can accommodate at most 

(1 + r])C"dlog^ d{2T + 1) \2t + 6] 

orbit points for some universal constant C" . 
The Near Case: we have Zn G Sk with 2~'' < l/8d'^+i. By Corollary [7| a is 
the closest root to Zn and we need log2 | log2e — 5| iterations to get e-close 
to it. 

Since r G 0{logd) and the Far Case dominates the Intermediate Case, the claim 
follows. □ 

We now conclude the main statement. 

Proof of Theorem^ By Theorem [3j for each root there is a starting point satisfying 
the conditions of the theorem. In particular, these orbits are i?-central for a univer- 
sally bounded value of R. Note that the d roots have to compete for the available 



area in D2i?,+2(0). Since the estimates in the proof of Theorem 11 are based on the 
area (except for the Near Case where the orbit gets to the region of quadratic con- 
vergence), we get the same estimate for the combined number of iterations (except 
that the estimate log | loge| applies for each root separately, thus it is multiplied by 
d). □ 

Remark 5. This result is close to optimal in the sense that the power of d cannot 
be reduced for our universal set of starting points that is bounded away from the 
unit disk. The reason is that outside the unit disk Np is conjugate to the linear 
map w I—)- by |HSS1 Lemma 4], so at least 0{d) iterations are required for each 
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"good" starting point to get close to the unit disk where the roots are located, and 
at least 0{(f) for all the d starting points combined. 
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